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ABSTRACT 

The greater part of this paper is dedicated to a 
non- technical discussion and criticism of the orinciples of 
Skinnerian behaviorism. Various aspects of the theory are examined, 
and its inability to deal with verbal behavior as a productive and 
creative activity is asserted. The author’s point of view is that 
expressed by Noam Chomsky in his criticism of Skinner's "Verbal 
Behavior." Evidence is produced to demonstrate that, contrary to the 
assertions of beha viorists, innate skills are involved in first 
lanquage learning. The author stresses that although it can therefore 
be supposed that every child learns his first language in the same 
way, independently of 10 or culture or training, there is no evidence 
that second language learning is one particular kind of task 
approached In the same manner by all learners. For this reason the 
author believes that there is no basis for believing that there is 
the possibility of finding a theory that can provide a universally 
valid technique for language teaching; he stresses instead the 
pragmatic aspects of language instruction. Finally, in considering 
how linguistics and psychology can help the language teacher, the 
author concludes that "Linguistics and psychology have nothing to say 
about the methods used in foreign languace teaching." (FWR) 
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I Mill begin by saying something about the Chomeky-Sklnner 

business, and then I'll talk opeh-endedly about recent results 

in psycholinguistios proper* I think in a way that the Chomsky- 

Skinner disousslon and the whole operant model of language lc a 

dead issue, and that it is relevant to almost nothing, but 

people who make materials for teaohlng you to teaoh language 

and people who make gadgets — -controlled environments, as they 

are sometimes oalled— for language teaohlng all olalm that 

they're based on soientlfio principles* The scientific principles 

they have in mind are usually the ones I'll be talking about* 

As I say, I think it all oomes to very little, but it's worth* 

0} while knowing the Jargon* 

00 

uO 

Skinnerian psyohology is based on a sort of modular system 1 

© for running psychological experiments* The modular system 
© 

consists, first of all, of something oalled a Skinner box* In 
• ^4 • 

. the box you put a rat, or a pigeon later on in the development 

of this system. The box is dosed, so that the experimenter is 

* 

not allowed to watch the organism behave, since if hs were 

o 
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allowed to watoh the organism behave, that might bias hie views 
on what the behavior was like. So the rat's behavior is reoorded 
eleotronloally. 

You begin with a rat in this box and nothing else —just a 
rat in a completely neutral environment. You start out by 
making the rat hungry, by subjecting him to about 24 hours of 
food deprivation. Then you run into this box a bar j under the 
bar is a trap that food pellets oan drop into. Outside the box 
there is a dispenser for food pellets. And then this system is 
oonneoted to a graph whioh will allow you to measure the 
frequenoy of the rat's responses «*s a funotion of time. Now, 
in this situation, if the rat is hungry enough, in walking around 
the box he will at some point happen to bang into the bar. The 
food dispenser is set up so that whenever he bangs into the bar, 
a food pellet will drop into the trap* The graph will now 
allow you to measure the frequenoy with whioh he bangs into the 
bar over timet one coordinate of the graph reoords the responses 
and the other coordinate reoords the time elapsed. What you 
find is that the rat moves along at eero response level until 
he hits the bart at this point the ourve goes up a little and 
maybe flattens out. For a while, little happens, and then the 
rut hits the bar again, gets another reinforoement, and the 
ourve goes up again, and then levels out* This happens a few 
times, and then all of a sudden, the rat, as you might say, gets 



the Idea, and the response wave takes off and flattens out at 
some fantastically high number. What has happened Is that the 
rat has learned in some sense that he oan get a pellet by press- 
ing the bar. And that pieoe of learning, as it were, shows up 
objectively in the inorease of the response rate, as a funotlon 
of time. 

This is all very straightforward. In faot, if it doesn't 
happen --if the rat doesn't dlsoover the bar— then, when 
nobody' 8 looking, you oan open up the box and push the bar down 
for the rat, and he'll notloe that the food pellet oomes and 
will start off that way. 

What makes it possible to turn this finding into psychology 
is the possibility of manipulating various parameters of this 
situation and looking at how the curve goes as a function of 
these manipulations. Millions and millions of Ph.D. theses 
have been done with this sort of modular pieoe of equipment in 
exactly that way. For example, you oan reinforce the rat, say, 
every thirteenth time he presses the bar instead of every time 
he presses the bar, or you oan reinforce him only during selected 
temporel siloes (that is, for 5 minutes he get no reinforcement 
when he presses the bar and then he gets reinforcement any time 
during thirty seconds, and then for another five minutes he gets 
no reinforcement, and so on). What we find is that the shape of 




the curve is a function of such kinds of variables! the frequenoy 
of reinforcement f the hunger of the rat t the amount of reinforce- 
ment, the amount of foroe that is required for him to depress the 
bar.,. And sinoe there are in principle Indefinitely many values 
of those variable?, you oan put out a lot of Ph.D. theses from 
this kind of design, and that's more or less what has happened. 

What ooours in this situation when the rat learns to press 
the bar under these various conditions is oalled operant learning, 
and the whole situation is oalled an operant paradigm. 

It is possible to Introduce another wrinkle into this 
system. If there is a second stimulus, this will provide a 
further condition whloh the rat will have to satisfy in order 
to get relnforoed, For example, suppose we put on the wall of 
the box two lights, a red light and a green light —I don't 
know whether aotually rats are oolor-bllnd, but let's assume 
they're not— and we set things up so that the rat will be 
relnforoed if he presses the bar when the red light is on, but 
not when the green light is on. So now relnforoement is 
contingent on two things —the bar-press and the presenoe of 
the red light. In that oase, the red light is oalled the 
discriminated stimulus! that is, the plus oondltion, the one 
for whloh he gets relnforoed. The other one is oalled the 
negative discriminated stimulus. 
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Now, the interesting thing about this --not wildly surprising, 
but interesting — is that after a oertain amount of training of 
this kind, a oertain correlation develops in the behavior of 
the rat. If he f s reinforoed only in the plus condition and not 
in the minus oondltlon, then what you find is that the blood 
pressure and behavior of the rat beoomes oorrelated with the 
presence of the discriminated stimulus | that is, he in faot 
presses the bar when the light is red, and he doesn't press it 
when the light is green. This is known as bringing the behavior 
of the organism under the oontrol of the discriminative stimulus. 

There is one further oomplloatlon that you oan introduoe 
into this sort of situation. Suppose that you use a light of x 
angstrom units for a oondltlon, and you train the rat to respond 
to the preaenoe of light of this partioular shade. Now, in a 
later trial, after training has taken plaoe, you oan test to 
8e what happens if for this red light of x angstroms you 
substitute a red light of g angstroms, with x not equal to 
What happens is pretty muoh what you'd expeot. Call the light 
of y angstroms a generalized stimulus, as distinct from the 
stimulus on whloh the rat was trained. If you make a graph of the 
response strength as a function of the quantifiable difference 
between the generalized stimulus* and the discriminative stimulus, 
a regular behavioral law sets in. If we take the origin to be 

i 

the discriminative stimulus and we put generalized stimuli at 
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various distances from the origin* and if this lnteroept is the 
trained response rate for the discriminated stimulus* then* 
roughly speaking* the ourve falls off as a funotlon of the 
dlstanoe between the generalized stimulus and the d..sorimlnatod 
stimulus. So you oan set up a law of behavior whloh relates 
the similarity between the originally trained stimulus and the 
generalized stimulus to the rate of reinforcement* and that 
states that the extent of generalization* (from the discriminated 
to the generalized stimulus) varies Inversely with the similarity 
of the stimuli. The more dissimilar they are* tho less transfer 
training and the less generalization you get. This Is* of 
oouree* a fairly obvious principle* That Is* If I teaoh my dog 
to oooe when I whistle In one way* then the probability that 
he* 11 oone when I whistle In another way la a funotlon of the 
similarity between the two whlstlesi the more dissimilar they 
are* the lower the probability that the dog will oome. 

Well* that Just about exhausts the oonoeptual oontent of 
the Skinnerian system* What I want to do now Is to say some* 
thing about how language looks if you try to reoonatruot it on 
this kind of model. 

To apply this model* you have to assume that language 
consists essentially of a set of habits* and that learning a 

4 

language consists essentially of learning a set of habits* 
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beoause what the organise is doing in the Skinnerian view is 
learning to respond in a oertain way under oertain conditions 
to develop a habit. 3o the Skinnerians start off by saying 
that there are two kinds of habits, suoh that the possession 
of those two habits oonstitutes the knowledge of the language. 
The first is the habit of making a oharaoteristio kind of verbal 
response in the presenoe of a oharaoteristio kind of verbal or 
non-verbal stimulus. That is, you have a discriminated stimulus 
whloh may be either verbal or non-verbal, and you have a habit 
of making a oertain kind of verbal response in ti e presenoe of 
that stimulus. 

So, for example, I presumably have the habit of saying 
"©rayon" in the presenoe of a orayon, and the habit of saying 
"glass" in the presenoe of a glass* As an example of a verbal 
stimulus, X presumably have the habit of saying "you* re welcome* 
when somebody says "thank you." Sc, on this analysis, there*s 
a discriminated stimulus --for example, somebody saying "thank 
you," or, for that matter, the presenoe of a glass— and I've 
learned a oertain verbal response to the ooourrenoe of these 
stimuli i in the first oase X say "you*re weloome," and In the 
seoond X say "glass*" So, this is one kind of habit —the habit 
of making a oharaoteristio verbal response in the presenoe Of 
e oertain kind of verbal or non-verbal etlaulus* 




The seoond kind of habit that la assumed to go into 
knowing a language is that of making a oharaoterietlo non- 
verbal response in the presenoe of a oertaln kind of verbal 
stimulus. For example, somebody says, "Do you have a light?" 
and I have the habit of taking out my lighter and lighting it. 

Or someone says, "Pass the salt," and I have the habit of passing 
the salt if there happens to be any around. (Or, for that matter 
presumably if someone says, "Where were you bom?" 1 nave the 
habit of saying "New York.") So that’s a oolleotlon of habits 
whioh oonslsts of a oharaoteristio 'disposition to provide 
verbal responses to oertaln non-verbal stimuli, and non-verbal 
responses to oertaln verbal stimuli. And the suggestion is 
that when I have developed this set of habits, I know English. 
That * 8 what it is to me to know friglish) namely, to have the 
appropriate habits at these times. 

This view of language oan be mapped onto the oonoeptual 
apparatus of the Skinnerian system. Consider first the habit 
of produo ing a oertaln verbal response to a oertaln non-verbal 
stimulus. What the Skinnerians* story has to say as a first 
approximation is that 1 got that habit by being relnforoed when 
Z produoed the appropriate non-verbal response when the non* 
verbal stimulus was In the area. So, when I was a little boy, 

Z happened to say "glass" when a glass was around, and my Noisy 

t 

patted me on the head, and that*s how Z picked up the habit of 



spying "glass" when a glass is around* Now of oourse that oan't 
be exactly true, nor does Skinner suppose It is, beoause the 
probability of my happening to make the verbal gesture "glass" 
in the environment of a glass Is presumably negligibly small, 
when I*m a H-year-old baby. Or if my grandfather oomes into 
the room, I have to say, "That 1 s good old grandfather," if I'm 
going to get reinforoed. But the probability of an untrained , 
li-year-old baby happening to make the noise, "That*s good old 
grandfather," is presumably infinitely small* 

So it oan't be that one merely sits around and waits for 
the baby to say, "That's good old grandfather," in the way that 

one sits around and waits for the rat to press the bar* This 

requires a further notioni operant shaping (this is how you get 
a pigeon to play tennis)* Suppose that instead of waiting for 
the rat in the Skinner box to press the bar, we put him in a 

oage and he wanders around in the general direotion of the bar| 

what we want him to do is to press the bar. So what we do is 
everytiae he heads in the direotion of the bar, we give him a 
pellet. We rapidly find that the frequency with whloh he heads 
toward the bar will inoreaae. Now we ohange the oondition of 
reinforoement, so that he only gets a pellet when he gets within 
a oertain area fairly olose to the bart and we find that the 
frequency with whloh he gets within that area of the bar 

i 

increases. Now we change the sohedule of reinforoement again, 

so that he gets reinforoed only when he's within a oertain area 

o 

ERIC 



even oloser to the bar. By gradual shaping of his behavior, we 
eventually get him to the position where we want him --he bangs 
the bar; and we shift the schedule of reinforoement so that he 
is reinforoed only when he aotually hits the bar. So now we can 
have a pleoe of behavior that Is not In faot originally exhibited 
at any high frequency at all, beoomlng quite regular. 

Suppose we wanted to get the rat to stand on Its head —I 
doubt if he can, but suppose we wanted to. Well, presumably he 
never stands on his head In his normal ecology, because that's 
just not one of his trioks. So what we do Is give him a pellet 
first when he puts his head down a little; then, when we've got 
him putting his head down very frequently, we give him a pellet 
only when he happens to hit, the floor when he puts his head 
down. And so forth and so on, until by some set of successive 
approximations, we give him a pellet only when he stands on his 
head, in case he ever happens to do so. 

Well, it's exactly this kind of operant shaping that goes 
on when Kommy gets the baby to say "glass," When the baby is 
brought Into an environment in which there happens to be a glass, 
in the course of his random babbling he will sometimes make 
sounds like "NGMPPP" or"GIHH." These sounds would be set up td 
be reinforced in the presence of a glass. That is, Mommy will 
pat him on the head and give him an M&M or whatever mothers do, 



rfhen he makes that sound. When he Is regularly making this 
sound in the presenoe of a glass, we oan shift the orite.rla of 
reinforcement, so that. we only reinforoe him for those "GIHH" 
sounds whioh are very olose to the first few sounds of "glass.* 1 
If we keep shifting our orlterla of relnforoement to response 
that are suooesslvely oloser to the oorreot pronunciation of 
"glass? over a long enough period of time, presumably we will 
shape the baby's behavior, so that he now says "glass” and 
nothing else In the presenoe of the discriminated stimulus of 
a glass. (You oan play this another way, and some Skinnerians 
are Inclined to. You oan olalm that on top of the apparatus 
described above, the ohlld also has an innate tendenoy to 
Imitate. In this oase, one oould assume that the parent says 
"glass” when a glass is around and the ohlld tries to imitate 
that, and then the quality of his Imitation and his tendenoy to 
say "glass” are shaped by relnforoement. This gets you off the 
ground muoh faster.) 

The other set of habits —making a speolfio kind of non- 
verbal response to a speolfio kind of verbal stimulus— are 
treated in a similar fashion. For example, when I was three 
years old, somebody said, "Pass the cigarettes," and I passed 
the cigarettes and got patted on the head, and this happened 
with suffioient frequenoy that 1 developed the habit of pt^dubihg 
the non-verbal response of passing the cigarettes in the presenoe 



of the verbal discriminated stimulus of somebody saying "Pass 
the oigarettes," 

Now here is where generalization oomes in. You might 
suppose, on the basis of the kind of account presented above, 
that I would learn to say ’’glass" in the presence of a glass, 
but not when this same glass is turned upside down. But that 
is overed by the generalization clause, which says that, given 
that I've been trained to say "glass" in the presenoe of a glass 
the probability of my saying "glass" in the presenoe of an 
upside-down glass is a funotlon of the similarity between the 
two, in some rather grossly undefined sense of similarity. 

The principle of generalization also aooounts for how one 
produoes a response to stimuli which are dlstlnot but similar. 

< t r- 

For example, I'll pass the cigarettes whether I'm asked to do so 
by an adult, who produoes a certain kind of aooustio pattern 
when he says, "Pass the oigarettes," or by a three-year-old, 
who produces a different kind of acoustio pattern. This is 
accounted for by the principle of generalization, which says 
that my training transfers from one stimulus to other stimuli 
of characteristically similar kinds. 

Well, that is essentially —with certain mildly offensive 
aberrations, perhaps — what the Skinnerians' account of language 



look 8 like. What you have is a oonoeptual neohanlsm of saying 
how language is put together t for saying how the sot of habits 
oharaoterietio of having a language are assimilated —given 
that you assume that having that set of habits is what is 
involved in having language. And, in a sense, there Is Just 
nothing el se to say. You've explained what understanding a 
sentenoe is —that is, being disposed to produoe the appropriate 
non-verbal response, given the sentenoe that has been uttered) 
and you've explained what speaking the language is —that is, 
being disposed to produoe oertaln verbal operants, given the 
appropriate input stimulation. So, now that you've said what 
it 1 8 to understand the language and to speak it, you've said 
everything there is to say. 

(Suppose you wanted to apply this kind of system to a 
language learning situation. You would put somebody in the 
following condition) you show a glass on a soreen or something, 
and through a loudspeaker you say "glass." If he says "glass" 
properly, you give him an M4M or something) or you wait around 
for him to say something that sounds like "glass" and then give 
him an HAM, and eventually he ends up oalling a glass a glass. 
Now, in a oertaln sense, I can't really see what else you oan 
do. If you want to teaoh somebody what a glass is called, and 
the game is so played that you're not allowed to say, "Look, this 
is oalled 'glass, '"for some methodologioal reason —because it's 

o 

ERIC 



not considered solentiflo or something-- then about all you oan 
do is wait around for. him to say "glass" and tell him "you’re 
right" when he says it. This either works or it doesn’t.) 

But what I want to suggest is that the view of language 
on which this kind of story is based is Ju'it hopeless. It 
seems immediately obvious that these kinds of habits aren’t 
really what's at issue. Let me just make a couple of crude 
points and then go into this in detail. It is perfeotly obvious 
that in faot I don’t have the habit of saylug "glass" whenever 
a glass is presented, because I don't say "glass" whenever a 
glass is presented. There are billions of things of all kinds 
in the environment — people, faces, one's nose, and so forth— 
whioh are allegedly stimulus inputs but which you don’t name. 
Now, that's a sort of banal point, but it’s very important. 

When you talk a language, your behavior vis-a-vis the stimuli 
in your environment is very different from the rat's behavior 
vis-a-vis the bar-press. Once the rat has learned its trick, 
it presses the bar whenever there’s a ohanoe that this will 
lead to a reward. Language doesn’t work like that. There 
simply isn’t any specifiable set of response habits suoh that 
one oan rely upon a speaker produoing a oertain utterance given 
that a oertain stimulus situation has been presented, in the 
way that you can rely on a rat pressing the bar after a oertain 
period of training. 



Nox, this is so self-evident that Skinner even noticed it 
in Verbal Behavior, and what he says is that we oan bring verbal 
behavior under dlreot stimulus oontrol, in the following wayt 
we take a guy and we bring him into an absolutely bare room, 
in whioh the only notloeable object is a penoil 16 feet tall. 
Now, Skinner says that it's under those blroumstanoes that it's 
highly probable that the ,uy will say "pSnoil," This, is 
probably oorreot. The question is whether or not one wants to 
treat that as a paradigm of an lnstanoe in which one uses the 



language. As another paradigm of such tanoe, suppose I'm 

a lot bigger than you are, and 1 say "Say 'uncle* or 1*11 break 
your arm.” The probability under those olroumstanoes is very 
high that if you believe me, you* 11 say "unole," Or, I approach 
you with a large pistol and say, ”Say *unolo* or 1*11 blow your 
brains out.” And, depending on your view of what my oharaoter 
is, you*re very likely to say ”unole” under those olroumstanoes. 

But what one notioes Instantly is that these sorts of 
olroumstanoes, in whioh there is a dlreot correlation between 
the oharaoter of the stimulus situation and the oharaoter of 
the response, are highly atypical of language use. You have to 
put a person literally under physloal duress in order to get 
such a response, and then the question whether he responds 
appropriately or not is a function of how muoh duress he*s 
going to stand for, and not a question of his verbal habits. 




In a way | that same thing happens with the rat in the Skinner 
boxj the only way to get the rat to respond reliably is by 
making him very hungry. 

Now, I think that there’s a general consideration whloh 
underlies this that makes the whole Skinnerian business extremely 
uninteresting. That is, In any kind of ongoing behavior by any . 
organism --and especially language behavior by people— the 
aotual output Is a funotlpn of at least three things i the 
organism's knowledge of t)?e response system (the rat's know- 
ledge of how to press the bar, a person's knowledge of how to 
speak the language), plus the organism's utilities, plus what 
you could call the organism's local beliefs. By the rat's 
knowledge, in this case, I mean that he has the aotual muscular 
coordinations available to press the bar. He has certain 
utilities which come out in the characteristic Skinner situations, 
so that when he's hungry, he behaves reliably in aooordanoe with 
what we know to be his knowledge. And he has oertaln looal 
beliefs —that Is, what he has learned about the relation between 
hi 8 trained response and getting reinforced. Now, in the 
Skinnerian situation, you have known values for the rat's 
knowledge, utilities, and looal beliefs. The only belief 
that the organism could possibly develop in this situation is 
that pressing the bar leads to reinforcement, beoause the bar 
Is the only thing In the environments that is, the only 



available Induction for him to make is that it’s pressing the 
bar and not something else that leads to reinforoement, Given 
that these three are known values, you oan make extremely preolse 
predictions about behavior. 

But now look at the situation vis-a-vis language. If you 
think of language as literally a correlation between a stimulus 
situation and a response situation —if, for example, you regard 
language as a pieoe of knowledge that you should produce a 
certain response given a oertaln Input, then you have to look 
for correlations of this kind, and you don’t find them. The 
reason that you can't find them is intuitively evident— namely, 
what response a particular person gives to a particular stimulus 
at a particular time will depend on a variety of things other 
than his knowledge of the response system, that is, other than 
his knowledge of the language. It will depend on what his 
utilities are, what he believes, and so forth, and no psychology 
has anything to say about these kinds of conditions. So that 
if I walk up to a guy and say, "What’s the time?" and if I oan 
make the usual assumptions about his utilities and his local 
beliefs —his utilities will include things like politeness, and 
hi 8 looal beliefs will include the faot that his watoh isn't 
broken, and so on— then I can prediot what he’ll dot he'll 
look at his watch and say "3 o’clook," or whatever time it is. 

But if he happens to be drunk or angry, then he's likely to punoh 
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me In the nose. Certainly, whether or not he punohes me in the 
nose has absolutely nothing to do with whether he understands 
the question "What’s the time?" And that’s th9 Important part. 
What his response is oan throw ezaotly no light on the question 
of what he knows about the language. And that is a way of 
saying that there is no possibility of oonstrulng his knowledge 
of the language in terms of a disposition to prcduoe a oharao- 
teristlo response in a characteristic stimulus situation. 

This is the orltloal mistake i to suppose that what a 
speaker knows is that he is supposed to give a certain response 
in the appropriate stimulus situation. What I think is really 
going on is that the person’s knowledge of the language has 
some wildly different form from that which linguists, psycho- 
linguists, and phllosphers try to study; and the response is a 
funotlon of what I think are actually a lot of different 
variables. That the response is in English rather than 
Hindustani is obviously a funotlon of what he knows about the 
language, but whioh response he gives in a particular lnstanoe 
will depend not only on his knowledge of the language, but 
also on many other factors, lnoluding his utilities and beliefs. 

If you read Chomsky’s review of Verbal Behavior , and I 
highly recommend that you do, you get the feeling that there is 
a radical difference between language, whioh has the properties 
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of productivity and stimulus freedom, and what the rat does In 
a Skinner box. What I'd be inclined to say is that Skinner 
even gives a bad analysis of what the rat does. What we've 
done is put the organism in a situation where the only strategies 
available are brute foroe strategies. Any organism behaves 
differently when you watoh it behaving in its natural eoology, 
from the way it behaves when you watoh it in a Skinner box. 

This is olearly an impoverished situation. The only intelligence 
that the organism may have is reduced by the brute exlgenoles 
of the situation to an lnduotion about the results of pressing 
a bar. If, on the other hand, you watoh a rat building a nest, 
which they do, you get a quite different pioture of the oapaoitles 
of the animal, 

I think, in faot, that in this respeot there's no difference 
between a rat and a person, exoept that in the Skinnerian 
situation the rat is extremely hungry and is given a very 
limited kind of situation in whioh to form his looal beliefs. 

That is, if you put a rat into this completely controlled kind 
of environment, in which a) he wants food, and b) there's only 
one kind of manipulation he oan perform on the environment -*he 
can press a bar-- then the only conceivable induction he oan 
make is that he gets a pellet by pressing the bar. It may be 
that there are speoifio differences in the kinds of strategies 
that various organisms will try for this situation, but I'm 



inclined to doubt it, A super-smart rat might think to himself , 
"Well, there’s only one thing I oan do in this situation, so 
I’ll go do it," and a oharaoteristioally dumb rat mlg'r.c bang 
around until he runs into the bar. 

Looking at the rat's response ourve for this situation, 
you'd expect that if he were doing it by trial and error, you 
would get a smooth, slow approximation. In faot, that's not 
what happens at all. His response doesn't inorease at a smooth 
rate. What happens is that he does nothing at all for a while, 
and then he makes a couple of responses, and then the ourve 
really takes off. The suggestion is that the rat is doing 
exaotly what you're doing — ho sees what's going on. He sees 
that there' 8 some connection between pressing the bar and 
getting food, and he learns to handle the situation. So I 
think that it's dubious that there's any difference between 
the way a person would address this situation and the way a 
rat does. 

What I wish to argue now is that the basic reason that the 
Skinnerian system falls is that the prlnoiple of generalization 
has no oontent. Just as you have stimulus generalization, in 
which, for example, the habit of saying "glass" to a right-side- 
up glass extends to an upside-down glass, so also you have 
response generalization. Por Instance, if you train a rat in 



a box with the bar in one place and then put him in another box , 
with the bar in a different place, it will take him .much less t 
time to learn the task. Now, it is certainly true that for 
any of the skills ~ language of what the rat does in the Skinner 
box ~ the productivity of the skill is extremely important. 

Here there is a real fact to be explained, namely, that the 
organism, having learned one thing, transfers his training to 
something quite different. And the principle of generalization 
doesn't explain it. 

There is a well know experiment of a T maze, in which 
there* 8 a goal box in one direction which is plus, and a goal 
box in the other direction which is minus, and what you do is 
turn the rat around like this, and after a while, he does that. 
The rat starts out here, and he explores for a while, and 
eventually he finds the food here. 

Now what you do io flood the maze, so that the organism 
can't run it, but must instead swim in order to reach the goal. 
This entails a completely new set of motor responses. What 
happens is that the rat, previously trained to run the unflooded 
maze, successfully swims the flooded maze. Now, one might say 
that this is an Instance of response generalization. But . 
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notice what question is being begged. You have to say that 
swimming to the right and running to the right are sontfci.aw- 
more similar to each other than running right and running left, 
because, after all, this response generalises to swimming to 
the right, but not to running to the left. Khat's the principle 

M « . . 

of similarity that allows you to do that? Bow can you say that 

I ' , * 

running right and swimming right are more similar than running 
right and running left, when the latter pair in fact uses 
exaatly the same motor system, whereas the former pair uses ' 
different motor systems? 

Exactly the same kind of question arises about language* 

You might want to say that language is productive -- that you 
can understand sentences you've never heard before* Suppose 
you've been trained in the sentence 

(1) Mary went to school. 

How do you know that another sentence, which may be quite 
different, like 

(2) The boy who X used to know when X want golfing * 
in Chicago got sick 



is also a sentence f given that you weren't specifically trained 
In It? Vell f you night say that you just generalise from the 
former to the latter* Gut what Is the prlnolple of generalisation 
that says that (2) Is relevantly similar to (1) and so the 
training transfers) but the sentenoe 

(3) Sohool to went Mary 

1 8 not relevantly similar? If you don't spell out the notion 
of similarity) the prlnolple of generalisation Is vaouous* 

On the other hand, If you do try to spell out the prlnolples 
that underlie the faot that (2) but not (3) Is similar to (1) 
—namely, that (1) and (2) are grammatical sentences and that 
(3) isn't— then you get a very oomplex set of gramnatloal rules | 
what you got in effeot is a generative grammar. Thus, in a 
sense, the prlnolple of generalisation doesn't buy anything* 

There are a vast set of structures whioh the organism employs 
In determining what is similar. Using the prlnolple of 
generalisation to answer one question ..-how you transfer 
training from the struotures you've been trained on to new 
objeots— just opens up another question i how does the 
organism learn those prlnolples whioh determine whioh responses 
are similar and which are not? 

Slnllarly, suppose that I've been trained to oall a non* 
filter olgarette "cigarette." What Is the prlnolple of 
similarity that allows me to generalise this response, so that 
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I will also o&ll a filter olgarette "cigarette," but I will not ■* j 
oall a pieoe of ohalk "olgerette"? Certainly, a pleoe of ohalk 
is similar in some respeots to a olgarette \ the prlnolple of 
generalization has nothing to say about why it is not relevantly 
similar. 

Or, suppose you train an organism to respond positively to 
a oertaln note being played on the piano. If you look at the 
way he generalizes the response^ you may guess that he will 
respond positively to similar notes and not to dissimilar notes. 
What determines whether a note is similar to the note to which 
the organism has been trained to respond? Well, if you define 
similarity in terms of simple frequency, so that notes of 
similar pitoh will be relevantly similar, you turn out to be 
wrong. If you define similarity in tens of an ootave, so that 
notes whloh have different froquenoy but are In the same key 
are oonsldered relevantly similar, you turn out to be right. 

But the prlnolple of generalization, slnoe it is not equipped 
to tell yru what is similar and what isn't, has told you nothing. 

Or, suppose you train an organism to proas a bar when 
you show him a red triangle, but not when you show a green 
olrolet What will happen now if you show a green triangle or 

a red oirole? It is olear that the prlnolple of generalization 

* 

1 n of no help here} it oar. make no predlotlon, slnoe you don't 




2 . 4 . 



know what similarity means to the organism in question. (By 
the way, there Is no general answer in this case; whether the 
organism will respond to the green triangle or red oirole 
depends on many factors i its speoies, maturity, prior training, 
eto.) 



If you train a person to respond, positively to the word 
"toy,'* whioh of the following words will he tend to respond 
positively toi "toy" or "man"? Of oourse, the prinolple of 
generalisation oannot predlot, slnoe it provides no funotlon 
to tell you how the person arranges stimuli into similarity 
olasses (the answer is that ohildren in this oase tend to 
respond positively to "toy," adults to "manl). 

Chomsky, in his review of Verbal Behavior , says that 
language has two properties that oannot be aooounted for by 
Skinner* The first is productivity. If language is a habit, 
then it is a habit with infinitely many possible expressions. 
Slnoe you oan*t train an organism to do infinitely many tasks, 
at some point training has to stop and the organism has to 
take over and generalise. There is no aooount for this in 
the Skinnerian model. 

The second property of language that is left unaccounted 
for in the Skinnerian model is stimulus freedom. That is, what 



on* says Is not a simple funotlon of one's input* This point 
has been largely misunderstood. Chomsky has been taken as 
asserting that language behavior is not deterainistlo--that 
it is not physloally determined. This assertion may or may 
not be true, depending on your point of view, but it was not 
the point of Chomsky's remark. What he intended was that the 
response is a funotlon not only of the input but also of the 
Internal states of the organism. This, of oourse, is true not 
Just of language, but of Just about anything, 

I now wish to sketoh some lines of evldenoe that first 
language learning in humans involves Innate skills. To do 
this, I want to turn to an aspect of the language problem that 
was hardly notioed by the Skinnerians. When you hear a word, 
part of what you do in understanding it is figure out which 
word you're hearing. You are able to do this aoross many 
differences among speakers --speed of speech, whether a male 
or a female is talking, and so forth. How, in spite of these 
differences, do you manage to Identify a given word as that 
particular word, or a given sentenoe as that particular sentenoe? 
What the Skinnerians said Is that you are trained to reoogniee 
oertaln critloal aooustio features associated with a sentenoe 
that are in a one-to-one correspondence with the llngulatlo 
analysis. In effect, the Skinnerians said that speeoh perception 
is no problem, beoauae speeoh perception is equivalent to 



responding to aooustio oues. 

However, it is very unlikely that this is true. After 25 
years, we still haven* t been able to build a maohine whioh oan 
process the wave forms oontaining aooustio oues and interpret 
them as phonetio symbols. If speeoh recognition were just a 
matter of responding to aooustio cues, we should be able to 
build suoh a maohine. 

The reason that we haven* t been able to build a maohine 
to process continuous speeoh is the followingi the aooustio 
representation of a particular oonsonant is determined almost 
entirely by the vowel that it is attaohed to. Now, it is 
possible in some oases to have two distinot consonants, one 
of them before one vowel, and the other before a different 
vowel, where these distinot consonants have the same aooustio 
representation. This situation does not lead to oonfusion in 
your peroeption of these sounds) they are peroeptually 
distinguishable, but they are indistinguishable aooustioally 
if you look at their representations on a sound speotrogram. 

The reason that the two oonsonant sounds are peroeptually 
distinguishable is that in your head you have for eaoh sound 
a phonetio matrix of distinctive features. For example, you 
oan desoribe the sound u in terms of various features, suoh 

o 

ERIC 



n. 



as roundedness, voloing, oon sonant allty, oontinuousness, and 
so forth, where eaoh of these features has a partloular value* 
Eaoh distinctive feature matrix is in effeot a set of dlsorete 
Instructions at the phonetio level. Because of the limitations 
of your vooal apparatus, these dlsorete instructions get 
smeared all over the plaoe when it comes to pronouncing this 
sound in a word. Thus what you do in pronouncing this sound 
is go from a digital representation (the dlsorete phonetio 
matrix in your head) to an analog representation (that is, a 
continuous system employing your vooal apparatus, whloh is 
Inertial). 

Aotually, the situation Is somewhat worso than this. If 
you pronounoe the words "sloop" and "slip" you will notioe 
that some rounding ooours at the beginning of the word "sloop" 
even though the significant rounding oocurs later, in the 
vowel. So what you have is a situation where the aotual 
output --for example, the rounding of the s in sloop -- is 
determined not by the dlsorete set of instructions for the 
sound 8, but rather by a feature of a subsequent sound, in 
this oase, the vowel oo. So the instructions are not even 
transmitted sequentially in producing the sounds. 

Thus, to understand speeoh, you have to do two things* 
first, you have to compensate for the fact that the vooal 



apparatus is a noisy analog transducer, and seoond, you need 
non- sequential strategies for processing the signal, This is 
why we haven't been able to build a maohlne to do the Job* 

How, then, does the ohild learn to perform the seemingly 
complex operations Involved in processing speech? The above 
considerations make it difficult to maintain that he is trained - 
to do so, So the answer we ought to consider is that the Infant 
has Innately available to him a theory of relating dlsorete 
phonetlo matrices to a continuous inertial system, and a theory 
of non-sequential processing strategies, In faot, there is 
growing evldenoe that the ohild' s perceptual analysis, at least 
by the age of 3 or 4 months and perhaps even earlier, is 
essentially the same as the adult's, The idea is, then, that 
ohlldren are specially built to do this, rather than having 
to lear\ these oomplex theories, which at the present state of 
our knowledge we oan't even explain. 

3o here we afe dealing with a very highly specialised 
organism, which starts out with a great deal of highly structured 
information on what kinds of strategies to use on its Input, 

This view has two Immediate oonsequenoes for language teaohlng, 
First, teaohlng a ohild how to speak his first language is about 
as useful as teaohlng a bird to fly. In fact, there is even 
some amusing evldenoe that If you try to train the ohild, you* 11 



just slow him up. That depends on how you train him, but the 
things that have been tried so far have had negative ef foots* 

It appears that exposure to people talking language under 
normal oiroumstanoes is Just optimal for learning a first 
language. Second* if these Innate abilities are no longer 
available to the ohild after some age* say when he reaohes 
adolesoenoe —and there is some evldenoe for this— then you 
oanH draw any inferences for seoond language learning from 
first language learning. If this is true* then the only 
implication you can draw from this theory to language teaohlng 
is that this theory has nothing to say about language teaching, 

I think that this may be axaotly right. All that you read in 
the journals about scientifio techniques for seoond language 
teaohlng is absolute nonsense. There is nothing in current 
linguistics or psychology that gives any Indication about 
seoond language teaohlng. 

In faot, the situation might be worse than that. There Is 
real reason to suppose that every ohild learns his first 
language in the same way* independently of IQ, or cultures, or 
training, or what language it is, or anything of the sort. But 
there is no reason to suppose that seoond language teaohlng is 
one particular kind of task. How, if it is true that John learns 
a seoond language one way, and Kary another way, ard Peter does 
it 14 different ways, then the answer to the question "How do 



• you teaoh a language?" might be "Any way you like." Now, I'm 
not neoe8aarily saying that this Is neoessarlly truej what I an 
saying Is that, on the basis of the ourrent work In psychology, 
there Is no reason to believe that It's not true. 

There Is a belief that seems quite prevalent among language 
teaohers, that they need to have a theory In order to do their 
Job* But I don't agree* For example, If you're going to teaoh 
the skill of driving, you're not going to go about it by having 
a general theory of Internal oonbustlon engines or sonethlng 
and then deduolng theorems about driving from It* What you'll 
do Is buy some automobiles and take your students out on the 
road and say, "Drive. --I'll stop you If you do something wrong*" 
If someone olalms to have a fool-proof, solentlf ioally based 
theory of driving, he's a liar* But the laok of a theory 
doesn't prevent driving instruotlon from going on. Similarly, 
in my remarks here, I'm not suggesting that you stop teaohlng 
languages because there is no theory of language teaohlng* What 
I'm suggesting Is that we learn to live with the faot that there 
la no theory of language teaohlng at present. In the absenoe 
of a theory, what you have to do Is use jnur head. You make It 
up as you go along, and you try to do the best you oan, 



Someone might suggest that the reason that dlffsrent 
teaohers have different degrees of suoeess is attributable to 
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the sort of theory within whioh they are operating! but it may 
well be the oase that the theories are Independent of the results 
that follow. There is oortainly no known oorrelation between 
the theory and the effeotlveness of its application, 

I wish to stress the faot that just because linguistics 
and psyohology have nothing to say about the methods used in 
foreign language teaohlng t it does not follow that there may 
not be some methods of teaohlng that are better than others. 

X don*t think that these methods will oome out of the work 
being done in linguist ios or psyohology , but this is not to say 
that suoh methods oan't be found. 

Let me oonolude with a general remark. Classioallyi In 
the view of people like Plato and Spinoza, education pretty 
muoh consisted in applied ethios. That is, the question you 
asked was, "What should kids learn and what shouldn’t they 
learn, given that they oan’t learn everything?" Fairly 
reoently, einoe the time, say, of Dewey, a new view of education 
oame into prominence- -namely, that education is more or less a 
branch of applied psychology, that it should be oonoerned 
primarily with teohnique, X don’t agree with this view at all* 

A lot of olassroom procedures have been proposed for whioh there 
is not the least bit of evidenoe that they are preferable to 
other oonoeivable procedures. For example, there is an 
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unqueationed belief In the importance of the student-teaoher 
ratio, but not a shred of evldenoe has been offered to show 
that this has anythin,? to do with what the students learn. 

In faot, It Is a completely open question whether any teaoher 
Is better than none. Basloally I believe that there Is no 
reason to think that there Is a theory that oan provide a 
technique for teaohlng, and that an undue amount of time has 
gone Into searohlng for suoh a theory. 
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